PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG006602t1
Common NameTCM_006602
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family bHLH
Protein Properties Length: 616aa    MW: 69077.3 Da    PI: 5.5404
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG006602t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH36.87.1e-12356399755
                       HHHHHHHHHHHHHHHHHHCTSCC.C...TTS-STCHHHHHHHHHHHHHHH CS
               HLH   7 erErrRRdriNsafeeLrellPk.askapskKlsKaeiLekAveYIksLq 55 
                        +Er+RR+++N+++  Lr+l+Pk +      Kl+ a+iL  A+e++k+Lq
  Thecc1EG006602t1 356 VAERKRRKKLNERLYALRSLVPKiS------KLDRASILGDAIEFVKELQ 399
                       68*********************66......******************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF142151.7E-2122167IPR025610Transcription factor MYC/MYB N-terminal
PROSITE profilePS5088815.663349398IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000833.61E-13351403No hitNo description
SuperFamilySSF474595.5E-17352412IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003534.1E-16355404IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000101.7E-9356399IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene3DG3DSA:4.10.280.101.2E-16356412IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd048739.54E-4496559No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009555Biological Processpollen development
GO:0048657Biological Processanther wall tapetum cell differentiation
GO:0005634Cellular Componentnucleus
GO:0000978Molecular FunctionRNA polymerase II core promoter proximal region sequence-specific DNA binding
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 616 aa     Download sequence    Send to blast
MGVLRMSESL KVKMNIFQNL LERLRQLVGS KGWDYCVLWK LSDDQRFIEW MDCCCAGAEN  60
IESGGELQFP VTPVLPCRDV MFQHPKTKSC ELLAQLPSCM PLDSGSHAQT LISNQPKWLN  120
FSKNSDSNVL EEIVGTRILI PVAGGLVELF VAKQVCEDQN VVDYIATLCN ITLEQGGMMN  180
SSSMDAHVTV LNAQALNELQ PKHHLGNEDD QKDPTNHFQQ PVSLATTLET LNLPYDISSD  240
QIRSCNSPTN SLQQYNYLSE HKTKIDVYVE GSHDAFLPDH KVASPYNDNG LQEMDPLNSI  300
ITNESILIQG NDKDSIKQDN GRSDSMSDCS DQNDDEDDAR YQRRPGSKGP QSKNLVAERK  360
RRKKLNERLY ALRSLVPKIS KLDRASILGD AIEFVKELQN QVKELQDELE EHSDNDGSKK  420
TGLNGIHKNV QSEIFSQNEI AVDPNPEHDK GPNGFPVGGN GSVSKHKQDV EITSDKTQQM  480
EVQVEVAQID GNQFFVKVFC EHKPGGFVRL MEALDSLGLE VTNANVNSFR GLVSNVFKVE  540
IKDSEMVQAD HVRDSLLELT RNPSKGLSEM AKASENNNGI DCNYHKQQQQ LQHQLHNHHI  600
SSHHRHLHHF QKQLA*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1357364ERKRRKKL
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5912209e-48JX591220.1 Gossypium hirsutum clone NBRI_GE26295 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007041798.10.0Basic helix-loop-helix DNA-binding superfamily protein, putative
SwissprotQ9ZVX21e-158AMS_ARATH; Transcription factor ABORTED MICROSPORES
TrEMBLA0A061DXZ80.0A0A061DXZ8_THECC; Basic helix-loop-helix DNA-binding superfamily protein, putative
STRINGVIT_01s0127g00860.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM75892741
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16910.11e-144bHLH family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]